A suggested metric for cepstral ARMA-based speech classification

نویسندگان

  • F. Martínez
  • Antonio Guillamón
  • J. J. Martínez
چکیده

In this paper, we purpose a theoretical development of a metric for speech classification based on cepstral features obtained from ARMA models. Thus working with an ARMA model as a complex rational function, is possible to define a metric d(M,M´) between two stable ARMA models M, M´by means of the cepstrum coefficients of the models. This metric may be calculated algorithmically as a finite sum in the pole-zero domain. We suggest that the metric can be used in at least two circumstances: first, we might a large number of signals that come from various types of pathological sources and we wish to classify them; alternatively, we might the underlying models M i corresponding to several pathological voices and we wish to classify a voice (modeled as M, say) from one of those. In that case, we compute d(M,M i) for each i and we guess the (M i) closest to the model M.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Features for Speech Recognition using Temporal Filtering Technique in the Presence of Impulsive Noise

In this paper we introduce a robust feature extractor, dubbed as Modified Function Cepstral Coefficients (MODFCC), based on gammachirp filterbank, Relative Spectral (RASTA) and Autoregressive Moving-Average (ARMA) filter. The goal of this work is to improve the robustness of speech recognition systems in additive noise and real-time reverberant environments. In speech recognition systems Mel-Fr...

متن کامل

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

A Supervised Text-Independent Speaker Recognition Approach

We provide a supervised speech-independent voice recognition technique in this paper. In the feature extraction stage we propose a mel-cepstral based approach. Our feature vector classification method uses a special nonlinear metric, derived from the Hausdorff distance for sets, and a minimum mean distance classifier. Keywords—Text-independent speaker recognition, mel cepstral analysis, speech ...

متن کامل

Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms

One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...

متن کامل

Improved feature enhancement using temporal filtering in speech recognition

The difference between training and testing environments is the major reason of performance degradation of speech recognition. In this paper, to further decrease the mismatch, we apply temporal filtering, Auto-Regression and Moving-Average (ARMA) filtering or RelAtive SpecTrAl (RASTA) filtering, as a post-processor for the log-Energy dynamic Range Normalization-Cepstral Mean and Variance Normal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003